A n2 RNA Secondary Structure Prediction Algorithm
نویسندگان
چکیده
Several state-of-the-art tools for predicting RNA secondary structures have worst-case time and space requirements of O(n3) and O(n2) for sequence length n, limiting their applicability for practical purposes. Accordingly, biologists are interested in getting results faster, where a moderate loss of accuracy would willingly be tolerated. For this reason, we propose a novel algorithm for structure prediction that reduces the time complexity by a linear factor to O(n2), while still being able to produce high quality results. Basically, our method relies on a probabilistic sampling approach based on an appropriate stochastic context-free grammar (SCFG): using a well-known or a newly introduced sampling strategy it generates a random set of candidate structures (from the ensemble of all feasible foldings) according to a “noisy” distribution (obtained by heuristically approximating the inside-outside values) for a given sequence, such that finally a corresponding prediction can be efficiently derived. Sampling can easily be parallelized. Furthermore, it can be done in-place, i.e. only the best (most probable) candidate structure generated so far needs to be stored and finally communicated. Together, this allows to efficiently handle increased sample sizes necessary to achieve competitive prediction accuracy in connection with the noisy distribution.
منابع مشابه
PreRkTAG: Prediction of RNA Knotted Structures Using Tree Adjoining Grammars
Background: RNA molecules play many important regulatory, catalytic and structural <span style="font-variant: normal; font-style: norma...
متن کاملVisualization of RNA secondary structures using highly parallel computers
Results of RNA secondary structure prediction algorithm are usually given as a set of hydrogen bonds between bases. However, we cannot know the precise structure of an RNA molecule by only knowing which bases form hydrogen bonds. One way to understand the structure of an RNA molecule is to visualize it using a planar graph so that we can easily know the geometric relations among the substructur...
متن کاملRNA-SSPT: RNA Secondary Structure Prediction Tools
The prediction of RNA structure is useful for understanding evolution for both in silico and in vitro studies. Physical methods like NMR studies to predict RNA secondary structure are expensive and difficult. Computational RNA secondary structure prediction is easier. Comparative sequence analysis provides the best solution. But secondary structure prediction of a single RNA sequence is challen...
متن کاملRNA Secondary Structure Prediction by Optimization Technique: Genetic Algorithm
RNA Secondary Structure prediction is one of the most significant research areas in bioinformatics. This paper presents a RNA secondary structure prediction using the permutation-based Genetic Algorithm (GA). We analyse the selection function STDS and Keep-Best Reproduction (KBR) genetic algorithm replacement techniques, and also using the genetic algorithm crossover operators. We take the SsrS...
متن کاملApproximation Scheme for RNA Structure Prediction Based on Base Pair Stacking
Pseudoknotted RNA secondary structure prediction is an important problem in computational biology. Existing polynomial time algorithms have no performance guarantee or can handle only limited types of pseudoknots. In this paper for the general problem of pseudoknotted RNA secondary structure prediction, a polynomial time approximation scheme is presented to predict pseudoknotted RNA secondary s...
متن کامل